Interact, Embed, and EnlargE: Boosting Modality-Specific Representations for Multi-Modal Person Re-identification
نویسندگان
چکیده
Multi-modal person Re-ID introduces more complementary information to assist the traditional task. Existing multi-modal methods ignore importance of modality-specific in feature fusion stage. To this end, we propose a novel method boost representations for Re-ID: Interact, Embed, and EnlargE (IEEE). First, cross-modal interacting module exchange useful between different modalities extraction phase. Second, relation-based embedding enhance richness descriptors by global into fine-grained local information. Finally, margin loss force network learn each modality enlarging intra-class discrepancy. Superior performance on dataset RGBNT201 three constructed datasets validate effectiveness proposed compared with state-of-the-art approaches.
منابع مشابه
Learning Affine Hull Representations for Multi-Shot Person Re-Identification
We consider the person re-identification problem, assuming the availability of a sequence of images for each person, commonly referred to as video-based or multi-shot reidentification. We approach this problem from the perspective of learning discriminative distance metric functions. While existing distance metric learning methods typically employ the average feature vector as the data exemplar...
متن کاملExploiting Dissimilarity Representations for Person Re-identification
Person re-identification is the task of recognizing an individual that has already been observed over a network of video-surveillance cameras. Methods proposed in literature so far addressed this issue as a classical matching problem: a descriptor is built directly from the view of the person, and a similarity measure between descriptors is defined accordingly. In this work, we propose a genera...
متن کاملSupplementary Material for “RGB-Infrared Cross-Modality Person Re-Identification”
This supplementary material accompanies the paper “RGB-Infrared Cross-Modality Person Re-Identification”. It includes more details of Section 4, as well as extra evaluations of our proposed deep zero-padding method. 1. Details of Counting Domain-Specific Nodes In the third paragraph of Section 4.2 in the main manuscript, we quantify the number of domain-specific nodes in the trained network in ...
متن کاملMulti-Channel Pyramid Person Matching Network for Person Re-Identification
In this work, we present a Multi-Channel deep convolutional Pyramid Person Matching Network (MC-PPMN) based on the combination of the semantic-components and the colortexture distributions to address the problem of person reidentification. In particular, we learn separate deep representations for semantic-components and color-texture distributions from two person images and then employ pyramid ...
متن کاملFast person re-identification based on dissimilarity representations
Person re-identification is a recently introduced computer vision task that consists of recognising an individual who was previously observed over a video-surveillance camera network. Among the open problems, in this paper we focus on computational complexity. Despite its practical relevance, especially in real-time applications, this issue has been overlooked in the literature so far. In this ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i3.20165